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The  purpose  of  this  thesis. is  to  formulate  a  model  for  a  bandlimited 
quantum  communications  channel  and  to  investigate  the  channel  capacity  of 
such  a  channel  in  the  absence  of  external  and  thermal  noise. 

The  channel  capacity  of  a  quantum  channel  is  affected  by  quantization 

in  two  important  ways.   The  mere  fact  of  quantization  itself,  i.e.,  that  a 

quantized  signal  is  restricted  to  a  finite  number  of  states  which  it  can 

assume,  limits  the  ability  of  the  signal  to  contain  information.   The  other 

factor  which  degrades  the  performance  of  a  channel  is  that  every  bandlimited 

signal  contains  an  irreducible  uncertainty  which  is  due  to  the  uncertainty 

principle.   This  thesis  represents  an  attempt  to  construct  a  reasonable 

model  which  takes  these  two  factors  into  consideration. 
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Every  signal  in  the  proposed  model  is  assumed  to  be  associated  with 
a  wave  function.  AH  of  the  observable  properties  of  a  signal  can  be 
found  from  its  wave  function,  and  therefore,  all  of  the  information  which 
a  signal  can  contain  is  embodied  in  its  wave  function.   Information  is  ex- 
tracted from  the  signal  by  obtaining  estimates  of  some  or  all  of  the  prop- 
erties of  its  wave  function.  The  wave  function  of  every  signal,  band- 
limited  to  a  bandwidth  W  and  approximately  timelimited  to  a  time  interval 
T,  can  be  expressed  as  an  expansion  of  2TW  +  1  prolate  spheroidal  wave 
funct  ions. 

The  prolate  spheroidal  wave  functions  are  eigenfunct ions  of  a  certain 
integral  equation.   The  integral  in  this  equation  can  be  represented  as  an 
operator  which  is  Hermitian,  and  corresponding  to  this  operator  is  an 
observable  of  the  signal.   In  this  model,  a  receiver  which  measures  this 
observable  for  each  received  photon  in  a  signal  can  form  an  estimate  of 
the  wave  function  of  the  signal.  The  received  signal  can  be  represented 
by  a  set  of  2TW  +  1  numbers  which  give  the  distribution  of  the  received 
photons  into  2TW  +  1  different  states.   In  the  absence  of  external  and 
thermal  noise  each  of  these  numbers  is  a  random  variable  with  a  Poisson 
distr ibut  ion. 

Bounds  for  the  channel  capacity  of  a  bandlimited  quantum  channel  in 
the  absence  of  external  or  thermal  noise  are  found.   The  upper  bound  is 
derived  from  a  procedure  commonly  used  in  statistical  mechanics.   The  lower 
bound  is  derived  using  the  Gaussian  approximation  of  a  Poisson  distribution. 
It  is  shown  how  the  quantum  channel  model  formulated  herein  lends  itself  to 
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analyses  which  include  the  effects  of  external  and  thermal  noise.   Channel 
capacity  computations  can  be  made  for  these  cases  if  the  statistics  of 
this  noise  is  known. 

The  quantum  nature  of  electromagnetic  signals  is  discussed  briefly 
in  general  terms. 


VI  I  I 


CHAPTER  1 

QUANTUM  EFFECTS 

Introduct  ion 

Shannon's  famous  formula  [l]  for  the  channel  capacity  C  of  a  con- 
tinuous channel  of  bandwidth  W,  signal  power  S;  and  white  thermal  noise 
power  N  is 

C  =  W  log  L±Ji  . 

N 

This  formula  applies  to  a  model  of  a  communications  channel  called  the 
continuous  channel.   In  this  model  the  signal  s(t)  and  noise  n(t)  are 
represented  by  continuous  functions  of  time.   The  quantized  nature  of 
energy  and  the  Heisenberg  uncertainty  principle  assure  us  that  the  con- 
tinuous channel  does  not  correctly  represent  an  electromagnetic  commu- 
nications channel  because  in  such  a  channel  both  signal  and  noise  are 
discrete  and  their  measurement  somewhat  uncertain. 

Even  though  we  assume  that  the  electromagnet  ic -Sfi  el  d  at  the  receiver 
is  continuous,  the  problem  of  determining  the  field  by  natural  means 
involves  the  quantization  of  nature  and  the  Heisenberg  uncertainty  prin- 
ciple as  it  applies  to  the  canonical ly  conjugate  variables,  time  and 
energy.   Since  we  cannot  identify  our  received  message  with  infinite 
resolution,  we  cannot  attain  an  infinite  rate  of  information  identifi- 
cation. There  is,  therefore,  an  irreducible  uncertainty  associated  with 
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every  electromagnetic  signal, and  it  is  the  purpose  of  this  study  to 
investigate  the  nature  of  this  uncertainty  and  to  determine  its  effect 
on  the  communication  of  information 

We  limit  ourselves  to  the  study  of  bandlimited  signals,   We  are 
not  concerned  with  any  contamination  of  the  signal  due  to  thermal  or 
external  noise  in  the  medium  of  transmission.   However,  it  is  shown 
how  the  procedures  and  results  of  this  study  may  be  extended  to  cover 
the  case  where  thermal  and  external  noises  are  present.   The  only 
noise  considered  is  the  quantum  noise  which  arises  from  the  fact  that, 
since  nature  is  quantized,  we  cannot  have  a  physical  disturbance  which 
has  precisely  the  characteristics  of  a  continuous  bandlimited  signal. 
Further,  the  Heisenberg  uncertainty  principle  imposes  an  additional 
uncertainty  in  the  detection  of  the  signal. 

The  development  of  the  laser  has  made  possible  the  generation  of 
phase  coherent  monochromatic  light.   The  photons  of  such  a  light  signal 
all  have  the  same  energy   Modulation  may  be  thought  of  as  a  process 
which  changes  the  energy  distribution  of  a  monochromatic  signal  whose 
photons  all  have  the  same  energy  to  a  signal  whose  photons  may  have 
different  energies.   For  example,  if  a  monochromatic  electromagnetic 
signal  is  modulated  with  a  sinusoid,  then  the  energy  of  the  modulated 
signal  is  known  to  exist  at  the  carrier  frequency  and  at  the  two  side- 
band frequencies.   The  modulation  process  has  redistributed  the  energy 
of  the  photons  so  that  some  will  be  found  to  exist  at  all  three  fre- 
quencies.  It  is  postulated  in  this  study  that  every  signal  of  time 
duration  T  and  bandwidth  W  consists  of  photons  which  may  be  in 
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2TW  +  1  different  states.  Any  two  signals  which  differ  from  each  other 
do  so  simply  because  the  distribution  of  their  photons  into  these  2TW 
+  1  states  differ. 

The  natural  language  for  such  a  study  is  the  formalism  of  quantum 
mechanics.   Gordon  [2]  and  Stern  [3]  have  applied  the  principles  of 
statistical  mechanics  to  the  channel  capacity  problem  with  considerable 
success.   The  results  found  here  frequently  agree  qualitatively  with 
theirs.   Gordon's  paper  contains  a  good  collection  of  references  of 
recent  work  in  this  area. 

Quantum  concepts  affect  communication  signals  in  two  important 
ways   The  first  of  these  is  the  mere  fact  of  quantization  itself.  The 
second  is  derived  from  the  Heisenberg  uncertainty  principle,  a  con- 
sequence of  which  is  that  measurements  performed  on  a  bandlimited  signal 
contain  an  irreducible  uncertainty.. 

To  illustrate  the  effect  of  quantization,  consider  a  signal  which 
may  assume  a  value  from  zero  to  99«   If  the  amplitude  of  the  signal  is 
continuous  and  may  assume  any  value  from  zero  to  99,  then,  in  the 
absence  of  noise,  the  signal  contains  an  infinite  amount  of  information. 
If  the  signal  is  quantized  so  that  it  can  assume  only  the  integer  values 
0,  1,  2,  ...,  99,  then  the  signal  can  contain  only  log  100  units  of 
information.   If  the  signal  were  quantized  to  levels  which  were  spaced 
so  that  it  could  assume  only  even  integers,  then  the  information  con- 
tent of  the  signal  would  be  reduced  to  log  50  units  of  information. 
Thus  the  mere  fact  of  quantization  affects  the  channel  capacity  of  a 
system. 


The  Heisenberg  uncertainty  principle  states  that  any  two  canonical ly 
conjugate  variables,  such  as  time  and  energy,  cannot  simultaneously  be 
measured  with  an  arbitrarily  small  error.  The  energy  in  a  signal  exists 
in  packets  or  photons  and  by  specifying  that  a  signal  is  bandlimited, 
we  fix  the  energy  of  the  photons  to  lie  in  a  certain  interval  with  prob- 
ability one-  Therefore,  since  the  uncertainty  in  the  energy  is  not  zero, 
the  uncertainty  in  the  time  cannot  be  zero.  This  concept  is  discussed 
in  detail  later,  for  the  major  topic  of  this  paper  is  to  investigate  and 
clarify  the  nature  of  this  uncertainty. 

The  work  in  this  study  leans  heavily  on  the  work  of  Slepian,  Landau 
and  Pollak  [k,    5,  6]  on  prolate  spheroidal  wave  functions.   These  func- 
tions have  certain  properties  which  make  them  ideally  suited  for  use  in 
the  expansion  of  bandlimited  functions. 


Out  1 ine 

This  thesis  consists  of  six  chapters  and  three  appendices. 

Chapter  1,  the  present  chapter,  contains  an  introduction  and  an 
outline  of  this  thesis. 

Chapter  2  introduces  the  model  of  the  communications  system 
which  we  postulate.   It  discusses  briefly  the  concept  of  the  wave 
function  of  a  signal  and  shows  how  the  wave  function  can  be  expanded 
in  terms  of  the  eigenfunct ions  of  a  certain  integral  equation. 


Chapter  3  contains  a  brief  discussion  of  the  concept  of  an  observ- 
able and  of  how  an  observable  is  represented  in  quantum  mechanics.  The 
nature  of  the  wave  function  and  its  relationship  to  observables  are 


investigated.   Vector  notation  is  introduced. 


In  Chapter  k   a  quantum  channel  is  introduced  which  expresses  the 
properties  of  the  communications  channel  of  Chapter  2  in  the  language 
of  the  quantum  mechanics.  The  number  of  photons  observed  in  each 
degree  of  freedom  of  the  channel  is  shown  to  have  a  Poisson  distribu- 
tion. An  upper  and  lower  bound  on  the  channel  capacity  of  the  quantum 
channel  is  found  and  illustrated  graphically  for  several  cases  of 
interest. 

In  Chapter  5  the  inherent  ambiguity  of  signals  is  discussed  in 
general  terms.   It  is  suggested  that  the  concept  of  signals  as  con- 
tinuous functions  of  time,  a  concept  which  in  the  past  seemed  indigenous 
to  communications  theory,  may  lead  to  erroneous  results  in  some  cases. 

Chapter  6  contains  a  summary  of  conclusions. 


CHAPTER  2 

WAVE  FUNCTIONS  AND  PROLATE  SPHEROIDAL  FUNCTIONS 

A  Communications  Channel 

The  communications  channel  model  used  in  this  analysis  is  the  one 
shown  in  Figure  I.  The  transmitter  transmits  a  signal  consisting  of  a 
large  number  of  photons..  The  receiver  receives  N  of  these  photons, 
analyzes  them,  and  attempts  to  determine  what  signal  was  transmitted. 
From  Figure  1  it  is  clear  that  the  energy  propagated  in  the  y  and  z 
directions  is  of  no  interest  because  only  energy  propagated  in  the 
negative  x  direction  reaches  the  receiver.  This  thesis  is  valid,  how- 
ever, even  if  all  of  the  transmitted  energy  is  analyzed  by  the  receiver, 
The  negative  x  direction  is  used  as  the  direction  of  propagation  simply 
because  it  leads  to  a  desirable  sign  convention.   The  signals  of  the 
model  postulated  are  polarized  in  only  one  direction  and  are  not  dis- 
torted by  external  or  thermal  noise. 


Wave  Functions 

In  quantum  mechanics  a  wave  function  is  usually  defined  to  be  a 
solution  of  a  Schrodinger  wave  equation.  The  propagation  of  light  is 
not  governed  by  a  Schrodinger  wave  equation,  and  therefore,  light 
quanta  do  not  have  wave  functions  in  the  above  strict  definition  of 
the  term. 
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For  the  purposes  of  this  thesis,  however,  we  will  assiqn  to  everv 
signal  or  beam  of  photons  a  function  ¥(x,  y,  z,  t) ,  which  we  will  call 


a  wave  function  even  though  it  is  not  a  solution  of  any  Schrodinger 
wave  equation.  This  function  is  called  a  wave  function  mainly  because 
it  possesses  many  of  the  properties  of  the  more  conventional  wave 
function..  The  receiver  observes  the  signal  as  an  ensemble  of  photons, 
and  the  wave  function  of  this  ensemble  is  ¥(x,  y„  z,  t).   Since  the  y 
and  z  variations  are  not  detectable  by  the  receiver,  the  wave  function 
of  interest  is  the  one  dimensional  one  ¥(x,t).   Only  one  polarization 
is  being  considered  in  this  analysis. 

It  is  not  the  purpose  of  this  study  to  discuss  quantum  communi- 
cations in  terms  of  quantum  field  theory,  although  such  an  endeavor 
seems  worthy  of  investigation.  We  assume  that  the  transmitter  is  de- 
signed so  that  it  will  emit  a  one  photon  state  of  the  quantized  electro- 
magnetic field.  The  position  representation  of  quantum  mechanical 

— >  — > 
states  is  the  usual  magnetic  vector  potential  A(r,  t) ,  and  the  fields 

derived  from  this  potential  obey  Maxwell's  equations  and  the  second 

order  wave  equation.   In  this  study  we  are  taking  into  account  the 

effect  of  the  uncertainty  principle  by  realizing  that  we  cannot  know 

that  we  have  a  one  photon  state  and,  at  the  same  time,  know  the  position 

of  this  photon.   Strictly  speaking,  there  is  no  wave  function  in  quantum 

field  thoery,  but  the  function  A(r,  t)  is  analogous  to  the  wave  function 

Sf  used  in  this  paper  and  f A*( r,  t)  |  gives  the  probability  density  for 

— > 
finding  photons  at  r  and  t. 


Undispersed  Propagation 

Propagation  of  a  photon  in  free  space  is  an  example  of  undispersed 
propagation  {7].  This  is  true  because  the  velocity  of  propagation  c  is 
independent  of  the  frequency  of  the  photon.   Thus,  if  a  square  pulse  of 
photons  is  created  at  the  transmitter,  the  pulse  retains  its  shape  as 
it  propagates,  and  it  is  received  as  a  square  pulse.  The  analyses 

which  follow  can  be  generalized  to  apply  to  any  signal  which  is  undis- 

persedo 

The  time  independent  wave  function  is  obtained  from  J (x,  t)  by 

setting  t  =  0  and  getting  ^(x,  0).  Y(x,  0) ,  or  simply  ^(x),  and  its 

Fourier  transform  a(k)  are  related  by 

00 
¥(*)  "  T-  /  a(k)  ejkx  dk  , 

(2.1) 

00 
a(k)  =  /  Y(x)  e-Jkxdx  . 

-00 

a(k)  is  sometimes  written  as  a(p)  where  p  =  fik  and  is  called  the 
momentum  wave  function,   k  is  the  wave  number.  Y(x)  and  a(k)  are,  in 
general,  complex.   Usually,  it  is  the  custom  in  the  literature  of 
physics  to  define  the  relationship  between  f (x)  and  a(k)  in  terms  of 
the  symmetric  transform  pair  which  has  a  factor  1//Tjr  in  front  of  both 
integrals  of  (2.1).  We  define  our  transform  as  in  (2.1)  because  it 
allows  us  to  use  the  prolate  spheroidal  wave  functions  unaltered  from 
the  literature  [4]  on  this  subject,, 
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The  physical  interpretation  of  the  wave  function  ^(x)  of  a  non- 
relativistic  particle  is  that  if  the  particle  has  wave  function  ^(x) , 
the  probability  density  that  it  will  be  found  at  the  position  x  is 
|¥(x)|   [8,  9],   Similarly,  the  probability  density  function  that  the 
wave  number  of  the  particle  is  found  to  be  k  is  |a(k) |2„  The  wave 
function  for  a  photon  signal  has  a  similar  interpretation.   |y(x)|2 
is  the  probability  density  function  for  finding  photons  at  x.   Simi- 
larly; the  probability  density  function  for  finding  photons  with  wave 

2 

number  k  is  |^(k)  |  „ 

If  the  propagation  is  undispersed,  then  the  relationship  between 
x  and  t  is  x  =  xQ  +  ct  and  ^(x,  6)   =  y(xQ   +  ct ,  0) .   We  have  assumed 
propagation  in  the  negative  x  direction,  which  conforms  to  the  con- 
vention of  Figure  I.   By  a  correct  choice  of  axis  we  can  let  xq  =  0 
and  speak  of  the  wave  function  as  f(ct,  q)  or  simply  Y(ct).   Making 
the  change  of  variables  x  =  ct  and  k  =  to/c  in  (2.1)  gives 


,  00 

*(ct)   =  JL    /     a(^)    eJU>t     *2  , 
<■%  JL         c  c 


00 

/  di\  c  ,         .  -J  COt 

'(7)    =     /     ¥(ct)    e  dct 


Normalizing  these  so  that  |^(t)|2  and  |a(u)|2  represent  probability 
density  functions  gives 
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*(0  =  i  /  a(u)  ejWt  do), 

Z  Jt  -oo  * 


(2.2) 


Thus,  the  probability  density  function  for  the  detection  of  photons 
2 
at  time  t  is  |f(t)|  .   The  probability  density  function  that  photons 

2  2 

have  frequency  u  (or  energy  -fioo)  is  |a(w)  |  .  The  integral  of  |f(t)| 

over  a  certain  time  interval  is  proportional  to  the  number  of  photons 
detected  in  this  time  interval. 


Bandlimited  Representation  in  Terms 
of  Eigenfunct ions 

The  quantities  of  most  interest  to  us  in  what  follows  are  ^(t)  and 
a(u),  and  we  will  refer  to  these  as  the  wave  function  and  the  frequency 
wave  function,  respectively.   It  is  expedient  in  quantum  mechanics  to 
represent  f (t)  in  terms  of  a  series  of  orthonormal  functions,  and  this 
procedure  will  be  followed  here.   The  set  of  orthonormal  functions  to 
be  used  is  the  prolate  spheroidal  wave  functions  of  Slepian,  Landau, 
and  Pollak  [k,    5,    6].  Appendix  A  is  devoted  to  a  brief  explanation  of 
the  origin  and  principal  characteristics  of  these  functions. 

The  spectrum  of  the  prolate  spheroidal  wave  functions  is  confined 
to  the  bandwidth  (-fl,fl  ),  which  is  centered  on  go  =  0.  These  functions 
are,  therefore,  video  type  functions,  and  the  analyses  in  this  chapter 
are  concerned  only  with  these  video  functions.   The  usual  electromagnetic 


a(w)  =  /*(t)  e"J'utdt  . 
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communications  signals  are  narrow  band  signals  whose  frequency  spectra 
are  zero  outside  of  a  narrow  band  of  frequencies.   Such  signals  are 
called  bandpass  signals.  Appendix  B  shows  how  the  concepts  in  this 
chapter  are  extended  to  cover  the  bandpass  case. 

The  prolate  spheroidal  wave  functions  are  a  set  of  real  functions 
of  the  real  variable  t  which  satisfy  the  following  three  equations: 


(A-l) 


(A-2) 


I    *|<0*j(t)  dt-  6..  , 

T 

/  ¥,(t)  ¥ .(t)  dt  =  \.  6..  , 
T  '    J        •   'J 


(A-3) 


I 

2 

\|»,(t)  =  /  siny  (t  -  s)  ¥,(■)  ds  , 
T     «(t  -  s) 
-7 


where  the  5--  is  the  Kronecker  delta  function.  The  \;  and^j(t)  are 
the  eigenvalues  and  eigenfunctions,  respectively,  of  the  integral 
equation  (A-3).   Every  fj(t)  is  bandlimited  to  the  band  (-fi.fi),  i.e., 
the  Fourier  transform  of  every  prolate  spheroidal  wave  function  is 
zero  outside  the  interval  (-a,^). 

Expressing  a  wave  function  ¥(t)  as  a  series  of  these  eigenfunctions 
gives 


(2.3) 


*(t)  -  ^oW    +  9j¥i(t)  +  a2«2(t)  + 


=  £  a,?,(t)  , 
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where  the  aF  may,  in  general,  be  complex,   The¥j(t)  are  all  real. 
The  completeness  of  the  set  of  prolate  spheroidal  wave  functions 
assures  us  that  every  function  limited  to  the  band  (-fl.G)  can  be 
expressed  in  a  series  of  the  form  (2„3).   We  will  deal  only  with  Y(t) 
which  are  normalized  so  that  |^(t)|2  is  a  proper  probability  density 
function.   This  requires  that 

(2^)  7   hr(t)|2  dt  =  i  , 

m     CO 

and  therefore  from  the  orthonormal ity  relation  (A-l) 


(2.5)  y  |ai|2 


A  Theorem  of   Landau   and  Pollak 

r 

The  great  utility  of  this  set  of  funtions  is  embodied  in  the 
following  theorem  stated  by  Landau  and  Pollak  [6].   The  theorem  is 
slightly  paraphrased  for  our  use^ 

Theorem-   Let  f(t)  be  any  function  of  time  whose  spectrum  is  zero 
outside  the  interval  (-ft, p.),  and  let  ^(t)  be  normalized  so  that 


/  l¥(t)|2  dt  =  1 


If 

T 
7. 


(2-6)  /  |y(t)|2  dt  -  1  -  e2  , 


T 


T 
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then 


oo         2TW        2  ? 

(2.7)      INF    /  |¥(t)  -   S  a.Y.(t)[   dt  <  12  e   , 
{a;}   "oo  i«0  T 


where  the  f (t)  are  the  prolate  spheroidal  wave  functions,  and  W  =  — . 

2« 
Equation  (2.7)  states  that  a  set  of  complex  numbers  (aQ,  a,,  ..., 

a2TW  )  can  always  be  found  such  that  the  integral  on  the  left  side  of 
(2.7)  is  less  than  12  €2 

According  to  the  postulates  of  our  model  there  is  a  one-to-one 
correspondence  between  the  frequency  of  a  photon  and  a  frequency  lying 
in  the  band  (-fi,fl).  This  means  that  the  counterpart  frequency  of  each 
photon  lies  inside  the  band  {-a, a)    with  probability  one.   Thus,  |a(u) |2 
must  be  zero  outside  the  interval  (-fl.fl).  ¥(t)  is,  therefore,  bandlimited 
to  (-a, a).      The  interpretation  of  a(co)  for  negative  u  wi  1  1  be  discussed 
in  the  sequel , 

One  way  to  state  that  a  signal  is  approximately  limited  to  the 
time  interval  (-T/2,  T/2)  is  to  state  that  the  probability  is  small 
that  photons  will  be  received  outside  that  interval.   A  precise  state- 
ment of  this  is  equation  (2.6),  which  states  that  if  a  signal  has  a 

wave  function  which  satisfies  (2.6),  then  the  probability  of  receiving 

2 
photons  outside  the  interval  (-T/2,  T/2)  is  e_.   Thus,  if  N  photons  are 

2 

received,  then  all  but  about  N€  will  be  received  in  the  time  interval 

(-T/2,  T/2). 

2 

If  e~    is  small, then  a  signal  consisting  of  a  large  number  of 

photons  is  approximately  timelimited  to  the  interval  (-T/2,  T/2) „ 
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Equation  (2,7)  states  that  if  a  bandl imited  function  is  approxi' 
mately  timel imited  in  accordance  with  equation  (2.6),  then  such  a 
function  can  be  approximated  in  the  mean  square  sense  by  a  truncated 
series  of  2TW  +  1  prolate  spheroidal  wave  functions.   Thus,  we  can 
represent  any  such  wave  function  by  a  series  of  the  first  2TW  +  1 
prolate  spheroidal  wave  functions 

2TW 

(2-8)  f(t)  =  £   ■f*|(t)  . 

i=0 

and  the  mean  square  error  of  this  representation  can  be  made  less 

2 

than  126  with  the  proper  choice  of  the  a;,  It  is  shown  in  \.k]    that 

this  representation  becomes  more  exact  in  the  interval  (-T/2,  T/2) 
as  T  gets  larger. 


CHAPTER  3 

OBSERVABLES 

Hermitian  Operators 

According  to  quantum  mechanics  [7,  8,  9],  every  physically  measurable 
property  of  a  signal  corresponds  to  a  Hermitian  operator,  such  that  the 
eigenvalues  of  the  operator  are  the  possible  results  of  a  measurement  of 
this  observable. 

A  Hermitian  operator  is  an  operator  R  which  satisfies  the  equation 

(3.1)  (*,  R0)  =  (R¥,  0)  , 

where  (f,  g)  means  the  scalar  or  inner  product  and  is  defined  by 

i 
CO 

(f.  g)  =  /  f"  9  dt  . 

-  to 

Consider  the  integral  operator  of  equation  (A-3)  ,  namely 
T 

(3.2)  R.  /*'"?(*  -0  t   }  ds  . 

T   a  (t  -  s) 

"2 

The  following  equation  shows  that  the  eigenfunct ions  of  the  operator 
R  are  the  prolate  spheroidal  wave  functions  and  the  eigenvalues  are  the 
corresponding  set  of  numbers  \g,  k, , . . .  ; 
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T 
7 

(3.3)  R  ¥,(t)  -  /  sin  Sfe   -   s>  ^,(5)  ds  =  \,Yj(t) 

_T   ir  (t  -  s) 
2 


That  the  operator  R  is  Hermitian  in  the  space  of  prolate  spheroidal 
wave  functions  is  verified  as  follows.   Let 


W  «£    aiVt} 


and 


*(t)  =y  bjY.(t) , 

then  {y,    R<t>)    =   ()   a,Y,,    R  ^  b;^F ) 


X    a^ib.X.    , 

I 


I'M.     i ' 


which  agrees  with 


(m,   <J>)   =»   (R^a,^;,      ^bjYj) 
i  i 

■    (}_,    aiXiYi»2JbIYj) 
1  i 

=    )     a  fb,\,    . 
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The  theory  of  measurements  further  states  that  if  we  measure  the 
observable  R  of  a  system  whose  wave  function  is 


?(t)  =  ^  ajf,(t)  , 
i 

2 
then  the  probability  that  the  result  is  \Q  is  |aQ|   and,  in  general,  the 

probability  that  the  result  is  \.  is  |a,|  .   Therefore,  if  we  measure  R 

for  a  large  number  N  of  photons, the  expectation  of  the  number  of  times 

2 
we  observe  X^    is  N | a^ |   for  k  =  0,  1,  ...,  2TW. 

We  have  already  suggested  that  the  wave  function  of  a  signal  con- 
tains all  the  information  which  a  signal  can  have,  and  that  the  wave 
function  of  a  signal  describes  it  completely.   The  above  analysis,  there- 
fore, suggests  a  model  for  a  receiver*  Suppose  a  receiver  measured  the 
observable  R  on  N  different  photons  and  measured  \q  hq   times,  \,  n,  times, 

and  in  general,  A.,  n,  times.  The  quantities  n0/N,  nj/N,  ...  would  give 

2      2 
estimates  of  the  quant  it ies  |ag|  ,  \a^\   .....  This  receiver  is  explained 

in  greater  detail  in  the  next  chapter. 


The  Dimensionality  of  the  Wave  Function 

The  general  theory  of  quantum  mechanics  allows  ¥(t)  to  be  complex. 
Since  the  set  ¥j(t)  consists  of  real  functions  only,  this  means  that  the 
coefficients  a;  in  (2.3),  or  (2.8)  in  the  approximately  timelimited  case, 
are  generally  complex.   It  is  the  purpose  of  this  section  to  show  that  a 
wave  function  representing  undispersed  wave  motion  can  be  uniquely  deter- 
mined  by  a  set  of  2TW+1  real  numbers. 
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The  sampling  theorem  states  that,  if  a  signal  of  bandwidth  W  is 
properly  sampled  at  a  rate  of  2W  samples  per  second,  then  knowledge  of 
the  sample  values  is  sufficient  to  reconstruct  the  signal  exactly.  This 
implies  that  a  signal  limited  to  bandwidth  W  and  time  duration  T  has 
about  2TW  independent  variables  or  degrees  of  freedom.   Examination  of 
(2.8)  shows  that  if  the  real  and  imaginary  parts  of  each  aj  were  all 
independent,  then  the  wave  function  representing  a  bandlimited  signal 
would  have  about  4TW  degrees  of  freedom.   If  the  imaginary  part  of  each 
a.  is  determined  by  the  real  part,  however,  then  there  are  only  2TW 
degrees  of  freedom,  as  the  past  has  led  us  to  believe. 

The  determination  of  the  number  of  degrees  of  freedom,  for  a  wave 
function  ¥(t)  is  embodied  in  the  interpretation  of  the  frequency  wave 
function  a(w)  for  negative  frequencies.   a(u))  is  derived  from  a(k)  or 
a(p) ,  and  a(p)  does  have  meaning  for  negative  p.  |a(p) | "  for  negative 
p  is  simply  the  probability  density  function  for  a  measurement  of  momentum 
in  a  direction  in  which  the  momentum  p  is  considered  negative.   The  equiva- 
lent interpretation  for  a(co)  for  negative  oo  is  that  |a(w)  |   for  negative  cj 
is  the  density  function  for  a  measurement  of  the  frequency  of  a  particle 
or  photon  traveling  in  the  direction  from  receiver  to  transmitter.   In  a 
practical  communications  system  the  signal  of  interest  travels  from  trans- 
mitter to  receiver  and  a(w)  is  zero  for  negative  to.   With  this  in  mind, 
let  us  define  a  wave  function  ¥(t)  whose  spectrum  a(u)  is  zero  outside  the 
interval  (0,fi).   Such  a  function  can  be  represented  as 

(3.4)  ¥(t)  =  4>(t)  +  j£(t)  , 


■ 
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where  <t>(t)  is  a  real  function  whose  Fourier  transform  is  a(co)  defined  by 
(3.5) 


a(co)   ■   1/2  a  (to) 

u)  >  0 

=  0 

co  =  0 

=   1/2  a*(-w) 

co  <  0 

0(t)  is  also  a  real  function  whose  Fourier  transform  a'(oo)  is  given  by 

(3.6)  a '(to)  *  -  1/2  j  a(to)  u  >  0 

=  0  co  =  0 

=  1/2  j  a*(-u)  co  <  0   . 

The  function  <J>(t)  is  the  Hilbert  transform  of  <t>(t),  and  the  relationship 
between  <J>(t)  and  <l>(t)  is  given  by 


00    ^> 


♦  (t)    -    -Iflfel      dT 

t-oo't-T 

(3.7) 

♦(t)  ,  1  /  idldr   . 

71    -oo    t"T 

The  integrals  are  taken  as  Cauchy  principal  values.   A  good  explanation 
of  the  use  of  Hilbert  transforms  in  communications  theory  is  given  by 
Dugundj i  [10].   (DugundJ i  defines  his  transforms  as  the  negative  of  the 
way  they  are  defined  classically,  as  in  Titchmarsh  [ll]  for  example.  We 
use  Dugundj i's  sign  convention  mainly  because  we  can  use  and  refer  to 
his  work  on  spectrum  analysis  without  altering  the  signs  of  his  results. 
The  results  reported  in  Dugundj i's  paper  are  easily  derived  from 
Titchmarsh's  analysis  of  Hilbert  transforms.) 
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We  see  from  (3.4)  that  the  real  part  of  the  wave  function  f(t)  deter- 
mines the  imaginary  part  through  the  Hilbert  transform  relation  or  through 
the  relationships  of  their  spectra.   Now  the  spectrum  of  4>(t)  is  limited 
to  the  interval  (-fl.fl),  and  we  have  already  shown  that  such  a  function  can 
be  approximated  in  the  time  interval  (-T/2,  T/2)  by  the  series 

2TW 
(3-8)  <>(t)  =V  bjY;  (t)  . 

i=0 

A 

Similarly,  *(t)  can  be  expressed  as 

2TW 


(3-9)  $(t)  m\    df^j(t) 


1-0 


An  examination  of  the  functions  a(co)  and  a'(u)  shows  that  <J>(t)  and 
<t>(t)  are  both  real;  therefore,  the  coefficients  bj  and  d;  are  both  real 
and  ¥(t)  can  be  written  as 


2TW         2TW  2TW 

i=0  i=0  i=0 


- i  w  e.  I  w  ZTW 

(3.10)    f'(t)  -£  a.¥.(t)  .£  (b.  +  jd.)  f,(t)  -V  A,, 


The  fact  that  ¥(t)  is  normalized  in  accordance  with  equation  (2.4)  leads 
to 

2TW  2TW 

(3.H)  £  (bf+dfj'-V  A?-  1  . 


i=0  i=0 
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Titchmarsh  L11J  shows  that,  since  4>(t)  and  <t>(t)  are  Hilbert  transforms, 


00  00 

(3.i2)  /  |*(t)|2  dt  =  /  |$(t)|2  dt 

-Cv  _00 

Therefore, 

2TW  2TW 

1-0  i=0 


Since  <t>(t)  is  real  and  is  normalized  by  equation  (3.13),  it  has 
only  2TW  degrees  of  freedom,  and  consequently  ¥(t) ,  which  is  determined 
once  4>(t)  is  known,  has  only  2TW  degrees  of  freedom.  Thus,  an  ordered 
set  of  2TW  real  numbers  (bQ,  b, ,  b2,  ....  b2TW)  with  the  restriction  of 
(3.13)  is  sufficient  to  determine  ¥(t)  uniquely. 


Vector  Notation 

- 

It  is  convenient  to  represent  wave  functions  and  operators  in 
terms  of  vectors  in  a  linear  vector  space.   (Most  quantum  mechanics 
texts  treat  this  topic  with  some  detail  [12].)   In  this  scheme 

2TW 
(2.8)  ¥(t)  -V  afY.(t) 

i=0 

is  represented  in  vector  notation  as 

(3.14)  ¥  -  (an,  aj  ,  ...,  a2TV/)  • 
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In  these  two  equations  there  are  only  2TW  +  1  coefficients  a;,  because 
we  are  representing  bandlimited  functions  which  are  t imel imi ted  in  the 
sense  of  Landau  and  Pollak's  theorem.   In  the  general  case  the  number 
of  coefficients  aj  is  unlimited. 

The  basij  vectors  for  the  vector  space  of  all  bandlimited,  approxi- 
mately timelimited  wave  functions  are,  of  course,  the  2TW  +  1  prolate 
spheroidal  wave  functions,  and  these  are  represented  in  the  vector  space 
as 

¥0  =  (1,  0,  0,  ....  0), 


f,  =  (0,  1,  0,  ....  0), 


(3.15) 


*2TW  '  (°'  °*  ° ]) 


The  operator  R  is  represented  by  the  Hermitian  diagonal  matrix 


(3.16) 


\0  0  0  ...  0 
0  \,  0  ...  0 


0  0  0  ...  X 


2TW 


Equation  (3-1)  is  easily  verified  in  the  vector  notation  as  is 
(3.3)  which  is  written  simply  as 
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(3.17)  M.     =    \j¥; 


When  expressed  as  in  equation  (3*14)  the  1    is  usually  called  a  state 
vector.   The  f|  of  (3.15)  are  called  eigenstates  or  eigenvectors. 

Observabl es 

Max  Born  [13]  has  lucidly  stated  the  interpretation  of  the  wave 
function  or  state  vector  somewhat  differently  than  was  stated  at  the 
beginning  of  this  chapter.   What  follows  in  this  paragraph  is  his  state- 
ment slightly  paraphrased  to  adapt  it  to  our  situation.   To  each  physical 
quantity  or  observable  belongs  a  real  Hermitian  operator  R.  The  eigen- 
functions  ¥q,  ¥,,.,.  (or  eigenstates  in  vector  notation)  correspond  to 
the  quantized  or  pure  states,  for  which  the  operator  takes  on  the  eigen- 
values \q,  \, ,  ....  Any  wave  function  ¥  represents  a  state  which  can  be 
considered  as  a  mixture  of  pure  states.   The  coefficients  a.  of  the  ex- 
pansion (2.8)  or  (3.14)  determine  the  strength  with  which  the  quantum 

state  Y.  occurs  in  the  general  state  f.  The  probability  of  finding  the 

2      2 

eigenvalue  \.  in  a  certain  measurement  is  given  by  |a||"  or  A;  . 

Thus,  as  far  as  the  observable  associated  with  the  operator  R  is 
concerned,  a  signal  in  the  general  state  ¥  consists  of  N  photons  each 
of  which  may  be  found  in  one  of  2TW  +  1  different  states  represented  by 
the  f..      In  this  study  we  have  constructed  a  model  in  which  information 
is  derived  from  the  observable  of  the  signal  represented  by  the  operator 
R.   The  exact  physical  description  of  this  observable  is  a  subject  for 
further  study. 


CHAPTER  k 

QUANTUM  CHANNEL  MODEL 

Messages  and  Signals 

The  discussion  in  the  preceding  chapters  suggests  the  following 
model  for  a  communications  system.   Some  sets  of  positive  real  numbers 
A  =  (AQ,  A,,  ...,  A2TW)  represent  messages  with  the  restriction  that 

2TW 

i=0 
Associated  with  each  such  message  there  is  a  bandl imited  wave  function 

¥  =  (a0'  al a2TVp  wnere  A|  "  la;  I  •   I"  order  to  transmit  the 

message  A  =  (A0,  A] A2TV/) '  tne  transmitter  simply  transmits 

energy  with  the  corresponding  wave  function  1   =  (an.   a,      a  ) 

v  0   1 '  *  ' '   2TVr ' 

According  to  our  model,  a  signal  consists  of  photons  each  of  which 
may  be  found  in  one  of  2TW  +  1  states  and  the  distribution  of  the  photons 
into  these  states  is  determined  by  the  quantity  A.  The  function  of  the 
receiver  is  to  determine  the  number  of  received  photons  in  each  state 
and,  from  this,  to  estimate  the  distribution  function  A  of  the  signal. 

The  receiver  receives  N  photons  and  performs  the  measurement  R  for 
each  photon.  The  result  of  this  measurement  on  each  photon  is  one  of 
the  eigenvalues  \Q,   \,  ,  ...,  \2TVT  Thus»'f  the  receiver  measures  K-f    n. 
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times,  this  means  that  nj  of  the  N  photons  were  found  to  be  in  state  i. 
The  receiver  determines  the  set  of  numbers  n  =  (n0,  nj  ,  . ..,  r^jy)  and 
this  set  represents  the  number  of  photons  found  in  each  of  the  2TW  +  1 
states.  The  restriction  on  this  set  is 


2TW 


(*.2) 


I    "'  =  "' 


i=0 


Since  each  n;  is  a  random  variable  with  mean  A.N  the  quantity  Wn./N 
can  be  used  to  estimate  A..   On  the  basis  of  n,  which  we  will  call  the 
received  signal,  the  receiver  attempts  to  estimate  the  transmitted 
message  A  =  (A  ,  A  ,  ...,  a2jvP  ' 


The  Poisson  Distribution  of  the  n; 


Each  of  the  n-  are  random  variables.   If  N  were  a  constant,  then 
the  set  (nQ,  n, ,  ....  n2TW)  would  have  the  following  multinomial  distri 
but  ion 
(4.3) 


p(n0- V  — '  n2Tw  =  Vrw|N)  %;.%  I 

0     2TW 


KaJ)-*  ...  (A^/*TV]. 


However,  the  number  of  photons  received  in  the  time  interval  (-T/2,  T/2) 
is  a  random  variable  with  expectation  value  N.   If  the  time  interval  T 
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is  sufficiently  large, then  the  distribution  of  N  is  the  Poisson  distri 
bution  [14] 


-N  -N 

(^)  p(n)  =  e-JL 

N 


where  N  is  the  expectation  of  N.   We  are  generally  interested  in  the 

case  of  large  T,  and  for  channel  capacity  considerations,  in  the  case 
where  T  gets  indefinitely  large.   Thus  for  sufficiently  large  T  we  can 
write  (4,3)  as: 


("-5)  p(no  -no "2tw -  W  °[y  N:.  t)2Tw.][Ao  )n° 

2       l2Tw].-g  ij" 


2 

Making   the  substitution   N  =  T)Q  +  r\]    +   ...    +  T}2TW,    and  N  =    (AQ  +   ...    + 

2N  - 
A2TW  )  N  gives 

p<".  ■  \ 2™  -  w  ■  (A°f-(ArRi'2TW  -^ '-'  4™)R 

no-   •••  HZTW 


2  _.T    -A;  N 


(4.6) 


2TW   (A:  N)    e 


-W 


i=0       r\. 


1=0      Tl.J 
i 
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2  - 

where  m f  =  A;  N.   Each  term  in  the  product  (4.6)  has  the  form  of  the 

Poisson  distribution.   Equation  (4.6)  clearly  shows  that  the  n.  are 

independent  random  variables,  each  with  a  Poisson  distribution  with 

o  - 

mean  m.  =  A.  N. 
i     i 

Summarizing  then,  we  have  found  that  a  bandlimited  signal  of  time 
duration  approximately  T  has  2TW  independent  degrees  of  freedom. 
Every  message  can  be  represented  by  a  set  of  real  positive  numbers 
(Aq,  Ai ,  ...,  A2TW)  with  the  constraint  of  equation  (4.1).   For  each 
of  these  degrees  of  freedom  there  is  a  number  n.  representing  the 
number  of  photons  detected  by  the  receiver  in  the  state  i.   The  nj  are 
each  independent  random  variables  with  the  Poisson  distribution 


P(n|)  = 


n.   -m: 

i 


ni' 


2  - 

where  mj  =  A;  N,  and  N  is  the  expected  value  of  the  total  number  of 

photons  received. 

Figure  2  shows  the  elements  of  the  model  described  in  this  section, 
The  input  to  the  decision  circuit  is  simply  the  set  of  numbers 

n  =  (n  ,  n  ,  ...,  n   ) .   This  means  that  there  were  n.  photons  found 

to  be  in  state  i,  for  i  =  0,  1,  ...,  2TW.   This  input  consists  of  a 

signal  vector 

2  - 
(4.7)  m  =  (m0,  m] ,  ...,  m2TW) ,  where  mj  =  A;  N, 

plus  a  quantum  noise  vector 
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(4.8)  q  =  (q0,  qj,  ....  q2TW) ,  where  q.  =  n 


i  "  mi 


where  q;  =  n?  -  m?.  Thus,  the  input  to  the  decision  circuit  can  be 

represented  as 

(4„9)  n  =  m  +  q, 

where  n.  =  m.  +  q . . 
i    i    i 

Each  rip  being  the  number  of  times  the  receiver  measured  R  to  be 
\;,  is  an  integer.   The  mj  are, in  general,  not  integers.  The  quantum 
noise  is  not  restricted  to  a  particular  set  of  quantized  values  since, 
although  n?  is  an  integer,  m;  is  not. 

As  far  as  channel  capacity  computations  are  concerned,  we  can 
simplify  the  model  of  Figure  2  to  the  model  shown  in  Figure  3<   The 
simplified  model  uses  m  as  its  input  signal  or  message.   This  is  valid 
because  m  is  completely  determined  by  A.  The  vector  q  can  represent 
any  noise  whose  statistics  are  known,  and  the  model  proposed  should  be 
quite  general.   In  what  follows,  however,  we  restrict  our  discussion 
to  the  case  where  only  quantum  noise  is  present. 


Channel  Capacity  Upper  Bound 

Keeping  in  mind  our  somewhat  unusual  notation,  namely  that  the 
signal  is  the  vector  m  =  (mQ,  m, ,  ...,  n^TVp  '  the  quantum  noise  is 
the  vector  q  =  (q0,  qj,  ...,  q2T\rf) »  and  tne  S'Ona1  P'us  quantum  noise 
is  the  vector  n  =  m  +  q  =  (nQ,  n^ ,  ....  n2TW) ,  we  will  now  derive  an 
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upper  limit  for  the  channel  capacity  of  the  model  of  Figure  4.  The 
general  formula  for  the  channel  capacity  is  given  by  Shannon  [15]  as 


(4.10)  C=  lim  -  Max(H(n)  -Hm(n)), 

T-»°°  ^ 

where 

(4.11)  Hn  =  -V  p(n)  log  p(n) 

n 

and 

(*.J2)  Hm(n)  =  -Vp(m)  Vp(n|m)  log  p(n|m)   . 


m 


H(n)  is  the  entropy  of  n,  the  signal  plus  quantum  noise.  Hm(n)  is  the 
conditional  entropy  of  n  given  the  signal  m. 

The  difficulty  in  getting  an  exact  solution  for  equation  (4.10)  is 
a  result  of  the  fact  that  the  signal  and  noise  are  not  independent. 

The  channel  capacity  of  any  discrete  channel  can  be  no  greater  than 
the  entropy  rate  of  the  received  signal.  We  will,  therefore,  use  this 
entropy  rate  of  the  received  signal  as  an  upper  bound  of  the  channel 
capacity.   That  is,  we  will  use  the  fact  that  if  the  processes  involved 
are  all  discrete  as  in  this  case,  the  C  of  equation  (4.10)  is  always  less 
than  or  equal  to 


33 


(4.13)  HR  =  Urn  1  Max  /  v  H(n) 


To  make  this  computation,  we  use  a  procedure  of  statistical  mechanics 
[16].   Let  U  be  the  number  of  distinguishable  ways  that  N  photons  can  be 
placed  in  2TW  +  1  compartments.   Thus,  if  N  photons  were  received,  then  U 
would  be  the  number  of  different  signals  which  could  be  received*   In 
Figure  k   there  are  N  photons,  each  represented  by  an  X.   2TW  lines  are 
required  to  divide  the  N  photons  into  2TW  +  1  compartments.   The  arrange- 
ment shown  in  Figure  k   represents  the  case  where  the  zeroth  compartment 
has  three  photons;  therefore,  v\q   =  3 .  The  first  compartment  has  one 
photon;  therefore,  n.  =  1,  et  cetera.   The  vector  represented  by  Figure  k 
is,  therefore,  n  =  (3,  1,  0,  3,  . ..,  2).   The  number  of  possible  ways  to 
arrange  N  +  2TW  symbols  is  (N  +  2TW) J   However,  since  there  are  two  groups 
of  indistinguishable  symbols,  i.e.,  the  N  X's  and  the  2TW  lines,  the 
number  of  distinguishable  arrangements  of  these  symbols  is 


(4.14)  _  (N+  2TW)i 

NJ  (2TW).' 


This  is  the  same  as  the  number  of  different  vectors  n  =  (n  ,  n  , 
no-ru)  which  can  be  formed  with  the  constraint 

2TW 


I 

1-0 


n;  =  N. 
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The  entropy  of  such  a  system  is  a  maximum  if  each  distinguishable 
event,  i.e.,  each  possible  received  signal,  is  equally  likely,  and  this 
entropy  is  given  by 

(4.15)  H  =  log  U  =  log  (N  +  2TW)  .'  -  log  Ni  -  log  (2TW) .'   . 

Stirling's  formula  states  for  a  large  argument  Z  we  can  approximate 
log  Zo1  as 

log  Z!  ^  Z  log  Z  -  Z  . 

Since  we  are  interested  in  the  entropy  rate  which  is  obtained  from  H  as T 
and  N  approach   infinity,  we  can  use  Stirling's  formula  in  (4.15), 
gi  ving 

H  =  (N  +  2TW)  log(N  +  2TW)  -  (N  +  2TW)  -  N  log  N  +  N 

-  2TW  log  2TW  +  2TW 

(4.16)   H  =  N  log  N  +  2TW  +  2TW  log  N  +  2™  . 

N  2TW 

Equation  (4.16)  states  that  if  N  photons  are  received  in  a  time  interval 
of  length  T  and  bandwidth  W,  then  the  amount  of  information  (in  bits,  for 
example,  if  the  logarithm  base  is  2)  which  can  be  received  is  given  by  H. 
The  entropy  rate  is  defined  as 

HR  =  1 im  I  H 
T_*oo  T 

=  1  im  {  H  log  N  +  2™  +  2W  log  tj  +  2TW  j  , 
T->°°   T        N  2TW 
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(4.17)     HR  =  p  log  (1  +  |W)  +  2W  log  (^+1)  , 


where 


p  -  lim} 


The  quantity  p  is  the  average  number  of  photons  per  second  in  the  signal. 
The  quantity  Hr  is  the  entropy  rate  of  the  received  signal  and  may  be 
measured  in  bits  per  second.   It  is  the  maximum  entropy  rate  which  the 
received  signal  may  have.   If  there  were  no  quantum  noise,  this  HD  would 
be  the  channel  capacity.   Since  there  is  quantum  noise,  we  know  that  the 
channel  capacity  must  be  less  than  HR)  and  we  use  HR  as  an  upper  limit 
for  the  channel  capacity  C,  i.e., 


C±HR 


As  a  further  simplification,  we  write  (4.17)  as 


(4.18)       H„  =  2W  {  E-  log  (1  +  QL)    +  log(E-+  1)} 
R       2W         p        2W 


Making  the  substitution  d  =  p/2W  gives 

(4.19)  HR=2WC(d), 
where 

(4.20)  C(d)  =  d  log  ( 1  +  Jf)  +  log  (d  +  1). 

The  quantity  d  is  recognized  as  the  average  number  of  photons  per  second 
per  degree  of  freedom.  The  quantity  HR  is  the  channel  capacity  of  a 
discrete  channel  when  no  quantum  noise  is  present.   Equations  (4.18)  and 
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(4.19)  are  similar  to  equations  found  by  Stern  [3]  and  by  Gordon  [2] 
using  a  different  technique.   Gordon  noted  that  the  first  term  in  (4.18) 
was  the  dominant  term  when  d  is  small  and  is,  therefore,  of  fundamentally 
quantum  origin.   For  large  d  the  second  term  predominates.  A  plot  of 
C(d)  vs  d  is  shown  in  Figure  5.  We  refer  to  C(d)  as  the  normalized  upper 
bound  for  the  channel  capacity. 

Up  to  this  point,  we  have  dealt  only  with  video  signals,  i.e., 
signals  whose  spectra  a(co)  were  either  adjacent  to  zero  or  centered  on 
zero.   Both  of  these  cases  actually  represent  the  same  physical  phenomenon, 
although  different  interpretations  are  required  for  each  case.   One  case 
of  great  interest  is  that  of  a  signal  whose  spectrum  is  bandlimited  to 
higher  frequencies  which  are  not  near  zero,  i.e.,  bandpass  signals. 
That  the  results  found  in  the  video  case  can  be  extended  to  bandpass 
functions  is  the  subject  of  Appendix  B.  We  simply  state  here  that  the 
bound  given  by  equation  (4.19)  applies  to  any  signal  whose  spectrum  is 
zero  outside  of  some  band  of  positive  real  frequencies  of  width  W  cycles 
per  second. 

If  a  narrow  band  signal  is  centered  on  the  frequency  v,  then  we  can 
say  that  any  photon  received  over  the  channel  has  energy  approximately 
hv.   Since  the  number  of  photons  received  per  second  is  p,  the  average 
received  power  P  is 

P  =  phv  =  2Wdhv, 
and 


(4.21)  d  = 


_  P 


2Whv 
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Making  this  substitution  in  (4.18)  and  (4.19)  gives  an  expression  for  the 
upper  bound  HR  of  the  channel  capacity  in  terms  of  the  average  received 
power  P„  This  expression  is 

("-22)  HK-2wl2^,09(l+2Tn;>  +  ,°s<2kr+,))- 

Figure  6  shows  a  plot  of  HR  vs  P  for  W  =  10^  cycles  per  second  and  for 
several  different  values  of  center  frequency  v.   The  expected  center 
frequency  range  for  laser  type  devices  is  from  about  1012  cps  to  lO1^ 
cps,  and  this  range  of  frequency  '^as  been  shown  on  the  graph. 

It  is  important  to  remember  that  the  entropy  rate  Hr  is  actually 
the  channel  capacity  of  a  noiseless  light  channel  quantized  as  in  our 
model  in  accordance  with  the  fact  that  electromagnetic  energy  is  not 
continuously  variable  but  exists  in  discrete  quanta.   If  there  were  no 
uncertainty  in  a  quantized  signal  then  HR  would  be  the  channel  capacity. 
HR  does  not  include  the  effect  of  quantum  noise  which  stems  from  the 
uncertainty  principle  and  is,  therefore,  only  an  upper  limit  to  the 
actual  channel  capacity. 


Channel  Capacity  Lower  Bound  for  Large  d 

In  this  section  we  will  derive  a  lower  bound  for  the  channel 
capacity.  This  bound  is  valid  for  large  d  only,  although  this  restric- 
tion to  d  »  1  will  be  discussed  after  the  bound  has  been  derived. 
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The  derivation  is  based  on  the  inequality 

(4.32)    Maxp(n)  [  H(n)  -  H|n(n)  ]  >  Maxp(n)  H(n)  -  Maxp(n)  Hm(n)  , 

the  left-hand  side  of  which  is  part  of  equation  (4.10).   The  Max p(n)H(n) 
has  already  been  found  to  be  the  H  of  equation  (4.15).   We  find  the  lower 
bound  by  over  bounding  Max  ,   .H   (n)  which  preserves  the  inequality  in  (4.23) 

(*•»)  Hm(n)  =^p(m)  H(n|m)  , 

m 

where  H(n|m)  is  the  entropy  of  n,  the  signal  plus  noise,  when  the  signal 
m  is  given,  which  is  the  sum  of  the  entropies  of  n;  given  mj ,  namely, 

2TW 

(4.25)  H(n|m)  =  Y  H(n,|m|). 

i=0 
The  H(n||mj)  are  defined  as 

(4.26)  H(nf  |mf)  =  \   p(n?|mj)  log  p(n;[m;). 

nf 

Since  p(n.|m.)  Is  a  Poisson  function  with  mean  mj,  it  has  variance  m- , 
also.   The  form  of  p(n;|mj)  which  maximizes  (4.26)  subject  to  the  con- 
straint that  the  variance  is  fixed  at  m;,  is  Gaussian  [17].  Letting 
p(nj|mj)  be  a  Gaussian  density  with  variance  m-    in  equation  (4.26)  gives 
H(n.|mj)  =  log  <J 2 nem f .   We  know  that  the  H(n||m;)  of  (4.26)  can  be  no 
greater  than  this.  Therefore, 


kl 


(^•27)  H(nj|mf)   1  ]ogN/2rtenij    . 

Putting  this  into  (4.25)  gives 

2TW 
(4.28)  H(n|m)  <V  log^/"2^m~i  =  I  log  (2Jte)2TW  +  '  mom, . . .m2TW  . 

Substitution  of  (4.28)  back  into  (4.24)  results  in  the  inequality 

(^•29)  Hm(n)  <  V  p(m)  I  log  (2m)' 


2TW  +  1 


m0mr-'m2TW  ' 


m 


where  p(m)  is  the  joint  density  function  p(m  ,  m, ,  . . . ,  m_  ) .  Now  since 

0   1        2TW 

2  m.  =  N,  the  right-hand  side  of  (4.29)  is  a  maximum  if  p(m)  is  a  2TW  +  1 
dimensional  delta  function  at  mQ  =  m,  = =  m2TW  =  - — .  Therefore, 

fc  I  W   "T"   I 

M"p(n)Hm(n)^}lo3(2,aIJ_)2™+l, 

or, 

(4.30)       Max,*  H  (n)  *  2TW  +  !  log  liM  . 
P(n)  mv       2       2TW+1 


Finally,  from  equations  (4.10)  and  (4.23)  we  derive  our  lower  bound 


CL  as 


C,    =  H      -    1  im  A  {   2TW  +   1    ]og  2,teN     } 
L  R     -r_»oo  T  2  2TW+1 


=  HD    -  W   log    *?£ 
R  w 
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From  (4.19)  and  (4.20) 


(4.31) 


CL  =  2W  [  C(d)  -  I  log  2,ied]  , 


(4.32)  CL  =  2W  [  d  log(l  +  l)+1og  (^d  +  jr-)  -  I  log  2ne], 


The  inequality  (4.23)  from  which  this  lower  bound  is  derived  is  the 
difference  of  two  entropies  as  is  usual  in  computations  involving  channel 
capacity.   However,  the  Maxp(nj  H(n)  is  the  entropy  of  a  discrete  system. 
It  is  the  logarithm  of  U,  the  number  of  distinguishable  ways  to  arrange 
N  photons  into  2TW  +  1  degrees  of  freedom.   From  this  was  subtracted  the 
entropy  of  a  continuous  system  with  a  Gaussian  distribution.   The  entropy 
of  a  continuous  system  is  not,  in  general,  comparable  to  the  entropy  of 
a  discrete  system.   They  are   comparable,  however,  when  d  becomes  large, 
because  then  the  entropy  of  the  discrete  process  approaches  the  entropy 
of  its  continuous  counterpart.   That  the  bound  is  invalid  for  small  d  is 
easily  seen  from  equation  (4.31).   For  d  <  l/2ite,  the  right-hand  term 
becomes  positive,  and  CL,  the  lower  limit,  becomes  greater  than  the 
upper  limit  HR  =  2WC(d).   For  d  greater  than  about  100,  CL  should  be  an 
accurate  lower  bound  for  the  channel  capacity. 

Making  the  substitution  d  =  P/2Whv  into  equation  (4.32)  results  in 
an  expression  for  CL  in  terms  of  the  received  power  P.  Figure  7  is  a 
plot  of  the  upper  bound  HR  and  the  lower  bound  CL  for  a  bandwidth  of  109 
cycles  per  second  at  a  center  frequency  of  10,Z+  cycles  per  second.  The 
plot  cannot  be  extended  to  received  power  levels  less  than  about  10"'° 
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watts,  because  d  becomes  too  small  and  the  lower  bound  C,  does  not  apply, 
In  the  case  illustrated  by  the  figure,  when  P  is  10    watts,  d  is  about 
75. 


Asymptotic  Behavior  of  the  Bounds 
From  (4.20)  we  see  that 


(^-33)  Mm  C(d)  =  log  e  +  log  d 

d-»°° 


Therefore, 


(^•34)  1 im  HR  =  2W  (log  e  +  log  d)  , 

d-)00 

and  from  (4.31) 


(4.35)  1  Im  CL  =  2W  (1  log  d  +  I  log  £_) 

d-»°°         2        2     2it 


As  d  gets  larger,  C  approaches  approximately  1/2  H_.  Thus,  for  large 

d  our  channel  capacity  is  bound  rather  tightly  by  C  and  H  . 

L       R 

An  interesting  form  for  C.  is  found  by  substitution  of  equation 
(4.21)  into  (4.35).   For  large  d  this  gives 


(4.36)  CL  ~  W  log  E__  . 

22  2WhV 
e 

Now,  since  P  is  the  total  power  of  signal  plus  quantum  noise,  i.e., 
P  =  S  +  N,  we  can  write  (4.36)  as 
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(4.37)  C.  -  W  log  ^JL 

L  N 


where  N  =(4rt/e) Whv.  This,  of  course,  is  in  the  form  of  Shannon's 
formula  where  the  quantum  noise  is  represented  by  additive  white 
noise  independent  of  the  signal. 


CHAPTER  5 
THE  INHERENT  AMBIGUITY  OF  SIGNALS 

The  study  of  signals  from  a  quantum  mechanical  point  of  view  raises 
some  questions  about  exactly  what  signals  are  and  how  they  should  be 
represented.   This  chapter,  an  appendage  to  the  main  body  of  this  study, 
presents  a  brief  discussion  of  these  questions  in  very  general  terms. 

Most  previous  work  in  communications  theory  is  based  on  the  concept 
of  a  precise  continuum  of  possible  signals.   That  is,  that  any  signal 
can  be  represented  by  a  precise  function  of  time  s(t),  and  that  s(t)  has 
a  completely  defined  value  for  all  values  of  the  argument  t.  This  is  a 
very  useful  mathematical  model,  for  it  allows  us  to  set  up  a  one-to-one 
mapping  between  the  set  of  all  signals  and  a  certain  class  of  precise 
mathematical  functions.   This  mapping  is  implicit  in  the  statement: 
Let  the  signal  be  s(t).   We  operate  with  these  functions  in  a  rigorous 
mathematical  way  and  determine  laws  and  theorems  which  apply  to  them. 
We  then  intuit  from  the  mapping  process  that  these  laws  also  apply  to 
the  signals  themselves. 

Quantum  theory  tells  us  that  signals  do  not  exist  as  precise 
mathematical  functions  of  time.   It  is  tempting  to  preserve  the  mathe- 
matical model  by  asserting  that  signals  do  exist  as  precise  mathematical 
functions,  and  that  the  difficulty  arises  in  our  inability  to  measure 
them  accurately.  This  is  certainly  not  the  operational  approach.   Signals, 

hi 
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as  we  know  them,  are  inherently  uncertain.  These  general  statements 
apply  to  any  kind  of  time  varying  signals  existing  in  the  real  world 
including  the  electromagnetic  signals  in  which  we  are  most  interested. 

Physical  signals  do  not  exist  as  precise  mathematical  functions  of 
time,  and  there  may  be  a  certain  danger  in  treating  them  as  though  they 
do.   Slepian  recognized  this  danger  and  discussed  it  briefly  in  [18]. 
It  is  not  possible  to  put  a  mathematical  function  precisely  into  the 
form  of  a  physical ly  occurring  s ignal  because  of  the  quantization  of 
physical  phenomena.   Further,  the  measurement  of  any  physical  signal  is 
subject  to  an  irreducible  uncertainty. 

One  attempt  to  circumvent  this  problem  would  be  to  assert  that  there 
is  no  such  thing  as  a  noiseless  signal  and  that  every  signal  must  be 
represented  as  s(t)  +  n(t),  where  n(t)  represents  a  noise  which  is  always 
present.   Such  an  assertion  implies  however,  that  the  mathematical  func- 
tion s(t)  +  n(t)  precisely  describes  a  natural  ly  occum  ng  phenomena. 
Such  cannot  be  the  case. 

Obviously  ?t  is  very  useful  to  represent  a  signal  s  by  a  function 
of  time  s(t),  but  what  does  s(t)  really  mean?  Suppose  our  mathematical 
function  s(t)  has  a  value  v]  at  t,,  i.e.,  s(tj)  =  v..  If  we  measure  the 
actual  signal  at  t,  as  accurately  as  is  physically  possible,  we  would 
get  some  number  v.   Quantum  mechanics  asserts  that  v  is  a  random  variable. 
If  a  number  of  identical  signals  could  be  prepared  and  measured  at  time 
tj,  then  the  results  of  these  measurements  would  not  be  identical.   The 
statement  s(tj)  =  Vj  then  should  mean  only  that  the  expected  value  of 
the  measurement  of  the  signal  at  tj  is  Vj.  Thus,  s(t)  should  be  inter- 
preted as  the  expected  value  of  the  actual  signal  at  time  t.   Obviously, 
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s(t,)  approximates  the  actual  signal  at  t  =  t,  very  well  if  the  natural 
processes  are  such  that  the  variance  of  v  is  small.   Similarly,  s(t) 
may  be  a  very  good  approximation  of  the  signal.  And  when  s(t)  is  a  good 
approximation,  then  valuable  results  are  obtained  through  the  mathe- 
matical analysis  of  continuous  functions.  The  implication  here  is  that 
we  may  not  always  be  justified  in  assuming  that  signals  are  continuous 
and  precise  and  that  such  an  assumption  may  lead  to  erroneous  conclusions, 
The  theory  of  communications  based  on  precise  continuous  signals  appl 
to  signals  macroscopical ly  observed  but  may  fail  to  explain  the  true 
nature  of  signals  when  observed  in  minute  detail. 


les 


CHAPTER  6 

SUMMARY  OF  CONCLUSIONS 

The  purpose  of  this  study  has  been  to  construct  a  reasonable  model 
for  a  quantum  communications  channel  and  to  study  its  properties.   In 
the  model  proposed,  every  signal  which  is  bandlimited  to  a  bandwidth  W 
and  approximately  timelimited  to  the  time  interval  T  has  a  wave  function 
¥(t)  which  can  be  expressed  as 

2TW 
(2.8)  ¥(t)  -V   a.Y.(t)  , 

i=0 

where  the  aj  are  complex  constants  and  the¥j(t)  are  prolate  spheroidal 
wave  functions.  An  observable  quantity  of  such  a  signal  is  represented 
by  the  hermit ian  operator  R  where 

T 

(3.2)  __  )     sin  n(t  -  s) 

R  -  J    «(t  -  s)   l  ]   ds  ■ 

-T 

2 

If  a  measurement  of  the  observable  R  is  performed  on  one  received  photon 
of  a  signal  with  wave  function  ¥(t) ,  then  the  outcome  of  the  measurement 
will  be  one  of  the  eigenvalues  \_,  \,  ,  ...,  ^2tu'   "*"ne  probability  that 

50 


51 


the  measurement  is  \.  Is  A?  for  i  =  0,  1 ,  . . . ,  2TW,  where  A,  is  the 
absolute  magnitude  of  a,.   If  the  measurement  is  performed  on  N  received 
photons,  then  the  number  of  times  \,  is  observed  has  the  expectatii 


on 

for  i  =  fl  Oti.i 


value  N  A?  for  i  =  0,  1,  ... ,  2TW. 


Every  wave  function  is  associated  with  an  ordered  set  of  positive 
real  numbers  which  can  be  written  in  vector  form  as  A  =  (A       A       a   \ 
where 

2TW 

<*•!)  y  a?  =  i  . 

?=0 

Any  A  can  represent  a  message.   In  order  to  transmit  a  message,  the  trans- 
mitter transmits  an  electromagnetic  signal  whose  wave  function  corresponds 
to  this  A.   The  receiver  performs  the  measurement  represented  by  R  on  each 
of  the  received  photons  and,  from  the  results  of  this  observation,  attempts 
to  determine  which  wave  function  and  therefore  which  message  was  transmitted. 

It  is  shown  in  this  study  how  the  description  of  this  model  can  be 
simplified  as  follows.  The  transmitter  transmits  the  signal  m  ■  (m       m 

o'    r 

'*••  m2TVp  wnere 

2T\(/ 
(6.0  V  m,  -  N  , 

1=0 

and  N  is  a  constant.   The  received  signal  is  n  »  fn  ,  n       n   ) 

0   1 2TW  ' 

where  n  =  m  +  q,  and  q  =  (qQ,  q]t    ...,  q   )  fs  noise> 

If  there  is  no  external  or  thermal  noise,  then  q  represents  only 
quantum  noise.   In  this  case  it  is  shown  that  each  n,  of  the  received  signal 
is  a  random  variable  with  a  Poisson  distribution  with  mean  m..  An  upper 
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bound  on  the  channel  capacity  for  a  narrow  band  noiseless  channel  of 
bandwidth  W,  center  frequency  V  and  average  power  P  is  found  to  be 

(6.2)  HR  =  2W  [  d  log(l  +  i)  +  log(d  +  1)]  , 

d 


where  d  =  P/2Whv.  The  quantity  d  is  recognized  to  be  the  average  number 
of  photons  received  per  second  per  degree  of  freedom,  and  h  is  Planck's 
constant,  A  lower  bound  for  this  channel  capacity  which  applies  when  d 
is  large  is 

(4.32)       CL  =  2W  [  d  log(l  +  1)  +  logi^Td  +  j±-)    -  -   log  2jte]  . 

The  channel  capacity  for  a  channel  in  which  there  is  no  external  or 
thermal  noise  lies  somewhere  between  the  two  bounds  given  by  (4.32)  and 
(6.2)  o   Therefore,  if  the  model  presented  in  this  study  represents  electro- 
magnetic communications,  it  will  never  be  possible  to  exceed  the  bound  given 
by  (6.2),  but  it  may  be  possible  under  ideal  conditions  to  attain  the 
capacity  given  by  (4.32). 


APPENDIX  A 
PROLATE  SPHEROIDAL  WAVE  FUNCTIONS 

The  purpose  of  this  appendix  is  to  summarize  briefly  the  origin  and 
characteristics  of  the  prolate  spheroidal  wave  functions  of  Slepian, 
Landau,  and  Pollak  [k,    5,  6]. 

The  set  of  prolate  spheroidal  wave  functions  are  a  countably 
infinite,  complete  set  of  real  functions  ¥.(t),  ¥  (t),  ...  of  the  real 
variable  t,  each  of  whose  Fourier  transform  is  zero  outside  the  interval 
(-Q,ft)  and  which  satisfy  the  following  three  equations  for  T  >  0  and 
fl  >  0: 


(A-i)  /y.(t)  y,(t)  dt  =  s  , 


I 
2 
(A-2)  /I  (t)  *  (t)  dt  =  \   6   , 

"T"  J  '     '  J 


T 
2 

(A-3)  \.y.(t)  =  /  Sin  Q(t  "  »).y  (,)  ds 

11      _j         Jt(t  -  s)     ' 

_2 


The  quantity  8-.  is  the  Kronecker  delta  function.   The  ¥;(t)  and  \;  are 
the  eigenfunct ions  and  eigenvalues,  respectively,  of  equation  (A-3)  from 
which  they  are  derived.  The  eigenvalues  are  real,  non-degenerate,  and 

are  ordered  as  X  >\  >\_  The  \j  are  always  less  than  one.   For 
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given  values  of  ft  and  T  the  \.    fall  off  rapidly  with  increasing  i  once  i 
has  exceeded  ftT/rt.   The  function  ¥((t)  is  either  even  or  odd  according 
as  i  is  even  or  odd  and  has  exactly  i  zeros  in  the  interval  (-T/2,  T/2) . 

From  (A-3)  we  see  that  the  ¥;(t)  are  functions  of  ft,  T  and  t.   In 
most  cases  of  interest,  however,  the  ft  and  T  are  constants  and  t  is  the 
only  variable. 

Slepian  and  Pollak  arrive  at  equation  (A-3)  by  asking  the  following 
question:  What  function  loses  the  least  amount  of  its  energy  when  first 
timelimited  and  then  bandlimited?  The  answer  to  this  question  turns  out 
to  be  the  timelimited  version  of  ¥0(t),  where  ¥0(t)  !s  defined  as  the 
eigenfunction  belonging  to  the  largest  eigenvalue  of  equation  (A-3).   This 
function  was  discovered  earlier  by  Chalk  [19], 

The  functions  ¥.(t)  are  scaled  versions  of  certain  of  the  angular 
prolate  spheroidal  wave  functions  discovered  and  tabulated  earlier  [20, 
21].   Just  as  Bessel  functions  can  be  found  by  the  solution  of  the  wave 
equation  in  cylindrical  coordinates,  the  angular  prolate  spheroidal  wave 
functions  were  found  originally  by  solving  the  wave  equation  in  prolate 
spheroidal  coordinates.   In  what  follows  we  use  the  notation  of  Flammer 
[20]  and  show  briefly  how  the  functions  ¥;(t)  can  be  derived  from  his 
work. 

The  prolate  and  oblate  coordinate  systems  are  two  of  the  eleven 
coordinate  systems  in  which  the  scalar  wave  equation 


2.  ,2, 


(A-k)  (V  N-  k^)  f  =  0 


• 


55 


is  separable.   If  the  wave  equation  is  expressed  in  terms  of  prolate 
spheroidal  coordinates,  then  the  solution  can  be  expressed  in  terms  of 
the  Lame  product,  as  in  Flammer's  equation  (2.2.5), 


(A"5>  fmn  "  Smn(C,Tl)  Rmn(C,£)  Jft  m  0, 

where  T[,t,<t>   designate  the  prolate  spheroidal  coordinates  and  C  is  a 
real  constant  proportional  to  k„  The  functions  Smn(C,T))  are  called 
angular  prolate  spheroidal  wave  functions  and  are  solutions  of  Flammer's 
equation  (2.2.6),  which  is 


(A-6)         <L  [(1  -  il2)  <L  Smn  (c,T|)]  +  EX,„n  -  cV  -  J!lL]  Smn(C,Ti) 
dT)         drj  1  -T)1 

-  0. 


Solutions  of  this  equation  exist  only  for  certain  values  of  the  separation 
constant  \mn.      For  m  =  0  and  C  =  0,  equation  (A-6)  reduces  to  a  form  of 
Legendre's  equation  whose  solutions  are  Legendre's  polynomials.  Therefore, 
the  solutions  S  (CH)  are  called  prolate  spheroidal  wave  functions  only 
for  C2  4   0. 

The  functions  Rmn(C,{;)  are  called  radial  prolate  spheroidal  wave 
functions  and  are  the  solutions  to  an  equation  similar  to  (A-6).   Flammer 
shows  that  Smn(C,T))  and  R  (C,£)  satisfy  several  different  integral  equations. 

These  functions  are   of  interest  to  us  primarily  because  they  satisfy 
the  integral  equation 
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(1) 
(A-7)      Smn  (C,ti)  Rmn  (C£)  cos  m<*  = 


2  it     1 

(4njn)-'  /  /  eJ'y  Smn(C,T)  cos  met)'  dr, '   d*>  , 
0  -1 


where 


y  =  C  fo'nf  +  (1  -  tV2),/2  (1  -  r)2)1/2  (£2  -  1)1/2  cos(<t>  -  *')] 


These  are  Flammer's  equations  (7.1.1)  and  (7.1.2a).   Rmntc>£)  's  callec*  a 
radial  prolate  spheroidal  wave  function  of  the  first  kind.   If  we  sub- 
stitute m  =  0  and  £  =  1  into  (A-7)  we  get 

(1)  2n   ' 

(A-8)      Son(C,n)  Ron(Cl)  =  (^jn)_1  /  /  eJ^  ^  Son(C,iT)  dl)1*1  . 

0  -1 


Making  the  change  of  variable  from  X\   to  T  and  from  r\ '  to  s  and  performing 
the  integration  in  <t> '  gives 

m  ]    • 

(A-9)      2jn  R;n;(C,l)  Son  (C,t)  =  /  eJCTs  Son(C,s)  ds  . 

This  equation  is  valid  for  n  =  0,  1,2  ...,  and  is  equation  (25)  in  Slepian 
and  Pollak  [h].      The  iterate  of  (A-9)  is  found  by  multiplying  both  sides 
by  e"-1    and  integrating  from  T  =  -1  to  T  =  1  .  This  is  done  as  follows; 

2jR  R(0(C,1)  /  e"jCTt  S   (C.T)dT-/  /  e"JCr(,"t)  S   (C,s)dTds, 
on       -        on  -i-i  on 
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2j^>(C,l)[2j"R(;»(C,.).Son(C;.-t)]'=_/  Son(C,s)  *■  ^rfp)  d.. 


(A-10)   2j2"  [R(^(C,1)J2  Son(C,-t)  .  /  'fa  C(t  •  .)  $on  (c>s)  dj 


Examination  of  the  series  expansion  of  S  (C,t),  Flammer's  equati 


on 


(3.4.1),  shows  that  Son(C,t)  is  even  or  odd  when  n  is  even  or  odd, 
respectively.  This  means  that  j  S  (C,-t)  =  S  (C,t),  and  we  can 
write  (A-10)  as 


<*-»)   r[  f&hcM*  »„(e.t)  -}  nffi  sy>  son(c,s)  ds , 


which  is  equation  (24)  of  Slepian  and  Pollak  [4]. 
Finally,  defining 

(A-12)  \n(C)  =  |£  [  R^C.J)]2 

and 


2\   (C)  ., 

=  /-! D —    *on«'Th 

T  /  [Son(C,t)]  dt 


and  making   the  appropriate  change  of  variables   converts    (A-9)    and 
(A-ll)    into 

(A-14)  jnfl     Ron   (C,l)   y  (c>t)   =    1       jnejUt  $   (CfWT)    du 

n  n  -n  n       2fl 
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and  T 


(A-3)  Vn(t)  -J  im_Mt_Z_ll^(s)  ds 


2 


The  orthogonality  of  the  set  ¥Q(t),  ¥,(t),  ...  over  the  two  different 
time  intervals  (-00,00)  and  (-1/2,    T/2)  as  stated  in  equations  (A-l)  and 
(A-2)  is  proved  from  equation  (A-3)  in  Slepian  and  Pollak  [4]. 

The  utility  of  the  functions  ¥;(t)  is  embodied  in  the  fact  that  they 
are  the  eigenvalues  of  the  finite  Fourier  transform  kernel  as  illustrated 
by  equation  (A-14).   It  has  long  been  recognized  that  the  eigenfunct ions 
and  eigenvalues  of  (A-3)  are  important  in  applied  mathematics  [19,  22]. 
The  fact  that  the  ¥.(t)  were  first  derived  from  the  solutions  of  the  wave 
equation  in  prolate  spheroidal  coordinates  seems  only  incidental.   It  may 
be  unfortunate  that  these  functions  have  been  given  the  f righteningly 
restrictive  name  of  prolate  spheroidal  wave  functions.   It  is  evident 
that  these  functions  have  great  applicability  in  the  field  of  communications 
theory  and  in  other  fields  where  prolate  spheroids  are  conspicuously  absent. 
Their  use  in  antenna  theory  itself,  the  field  of  study  which  led  to  their 
development,  is  even  broader  than  their  name  suggests  [23]. 


APPENDIX  B 


EXTENSION  TO  THE  NARROW  BAND  CASE 


Most  of  the  previous  analytical  work  in  the  body  of  this  thesis 
has  dealt  with  video  type  functions,  i.e.,  functions  whose  spectrum 
a(w)  exists  only  in  one  interval  centered  on  co  =  0,  or  other  functions 
derived  directly  from  these.   It  is  the  purpose  of  this  appendix  to 
show  that  the  results  found  in  Chapter  k   for  the  video  case  are  valid 
also  for  the  bandpass  case. 

The  prolate  spheroidal  wave  functions  are  all  limited  to  the 
bandwidth  (-fl,fl).   We  have  shown  in  equation  (3.10)  that  a  function 
whose  spectrum  is  zero  outside  the  interval  (0,q)  can  be  represented 
as  a  sum  of  prolate  spheroidal  wave  functions  and  that  such  a  function 
is  complex.   Thus,  the  function  Y(t)  whose  spectrum  a(co)  is  zero  out- 
side (0,ft)  can  be  represented  by  the  equation 


(3.4)  f(t)  =  *(t)  +  j  $(t)  , 


where  <t>(t)  is  a  real  function  whose  spectrum  is  zero  outside  (-&,o,) 
and  <l>(t)  is  the  Hilbert  transform  of  *(t).   Of  course,  the  spectra  of 

A 

<t>(t)  and  j  <t>(t)  combine  in  such  a  way  that  the  spectrum  of  <t>(t)  + 
j  4>(t)  exists  only  in  the  interval  (0,0,). 

It  will  be  remembered  that  we  chose  to  use  functions  with  one- 
sided spectra,  like  ¥(t)  in  (3.4),  primarily  because  of  the  quantum 
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mechanical  interpretation  of  |a(w)[   as  a  probabilty  density.  This  con- 
ventional approach  led  us  into  complex  wave  functions  whose  real  and 
imaginary  parts  were  Hilbert  transforms.   We  could  have  used  double- 
sided  spectra  with  the  interpretation  that  2|a(tt)|   was  a  probabilty 
density  function  for  positive  w  only.   Using  this  approach,  we  could 
have  simply  ignored  |a(to)[   for  negative  w,  and  this  approach  would 
have  led  us  to  exactly  the  same  conclusions  and  results  as  the  scheme 
used.   Of  course,  we  would  have  specified  that  the  two  sides  of  the 
spectrum  were  complex  conjugates,  and  real  wave  functions  would  have 
resul ted. 

In  what  follows  we  show  that  a  function  ¥(t)  whose  Fourier  trans- 
form a(w)  exists  only  in  the  interval  (un,  u>_  +0.)  can  be  represented 
in  terms  of  a  sum  of  2TW  +  1  functions  of  the  form  ¥j(t)eJ    where 
the  ¥;(t)  are  the  PSW  functions.  The  relationship  between  ¥(t)  and 
a  ( u>)  is 

UU+   ft 

00  0 

(B-l)  ¥(t)   =  —    /  a(cj)eJut   du  =  1-/  a(u))eJwt   dco  . 

2n  _oo  2jt  ojq 

Making  a  change  of  variable  to  to1  =  w  -  cj_  results  in 


(B-2)        ¥(t)  =    /  av(w,)e,U't  dco'  e-""^  , 

0 


where  a  (to)  is  the  video  vers  ion  of  a  (w)  ,  i.e.,  ay(w)  =  a(w  +  (Oq)  . 

Using  the  same  reasoning  which  resulted  in  (3»*0,  we  can  express 
T(t)  as 
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(B-3)  ¥(t)  =  (4>(t)  +  J  $(t)}  eJW^ 


where  4>(t)  is  a  real  function  whose  Fourier  transform  a(co)  is  defined  by 

(3.5)  a(u)  ■  1/2  av(w)    u  >  0 

=  0  co  =  0 

=  1  /2  a*(  -co)    co  <  0   . 

Thus,  the  spectrum  of  <J>(t)  is  zero  outside  of  the  interval  (-(2, ft),  and 

$(t)  can,  therefore,  be  expressed  as  a  sum  of  2TW  +  1  prolate  spheroidal 

functions  with  real  coefficients.   As  before,  0(t)  denotes  the  Hilbert 
transform  of  <t>(t).   The  spectrum  of  <l>(t)  is 

(3.6)  «,(«)  =  -  1/2  jav(co)    co  >  0 

=  0  w  =  0 

=  1/2  JaJ(-w)    co  <  0  . 

Once  the  function  <t>(t)  is  known,  then  4>(t)  can  be  found  by  the 

relationship  (3.6).   Since  the  spectrum  of  4>(t)  is  zero  outside  of 

(-ft, ft),  it  can  also  be  expressed  as  a  sum  of  prolate  spheroidal  wave 
functions.   Therefore,  equation  (B-3)  can  be  written  as: 

00 

(B-4)  ¥(t)  -{V  A.ej0i  f,(t)}  eJU0t  , 

i=0 

or 

00 


(B-5)  *(t)  -V   a.^.(t)  ejV 


i=0 
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This  shows  that  a  ¥(t)  whose  spectrum  is  zero  outside  of  the  interval 
(u)q,  o)q  +  ft)  can  be  represented  as  a  sum  of  functions  of  the  form 
¥.(t)e    .  All  of  the  analyses  for  the  video  functions  treated  in  the 
main  body  of  this  thesis  are   based  on  the  fact  that  the  prolate  spheroidal 
wave  functions  satisfied  equations  (A-l),  (A-2) ,  and  (A-3).  The  functions 
¥.(t)e  0  satisfy  a  similar  set  of  equations  and,  therefore,  all  analyses 
and  conclusions  which  apply  to  video  signals  apply  also  to  bandpass 
functions  whose  spectrum  is  zero  outside  of  any  continuous  interval  of 
length  ft.  These  important  equations  for  the  functions  ¥-(t)eJ  0  are 

00 

(B-6)  /  (Y,(t)  eJw0t)(Yj(t)eJw0t)"  dt  =  8,j  , 


T 
1 

(B-7)  /  (Ti(t)eJW0t)(Tj(t)e-JU,0t)"  dt  =  X.S..      , 

_T 
2 

T 
(B-8)  \F(YF(t)ej(JOt)   =  /  SiaJ&t   ~s)   e  J"0(t-s)  ^  (s)(J  wnsj    dj 

T 

2 


«(t  "  s) 


As  usual,  the  asterisk  denotes  the  complex  conjugate.   These  equations 

are  easily  verified  from  equations  (A-l)  through  (A-3).  The  observable 

R  is  represented  by  the  operator 

T 
I 


(B-9)  R  =  /§?"(t(t-s")S)  ^"^  >  * 


.r(t  -  s) 
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Expressed  in  matrix  form  R  is  given  by  equation  (3.16)  for  both  the 
video  and  bandpass  cases. 

Furthermore,  substitution  of  the  functions  Ufj(t)eJ^)  for  Yf(t) 
in  equation  (2.6)  shows  that  the  theorem  of  Landau  and  Pollak  stated 
in  Chapter  2  applies  to  these  bandpass  functions  as  well  as  the  video 
functions.  Therefore,  if  the  spectrum  of  a  function  is  limited  to  a 
continuous  interval  ft  =  2jiW,  and  if  the  function  is  approximately 
timelimited  in  the  sense  of  equat ions (2.6)  and  (2.7),  then  the  function 
can  be  represented  in  terms  of  2TW  +  1  functions  ¥.(t)eJ  0  as 

2TW 
(B-10)  ¥(t)  =  V  a^i(t)  e-lV  . 


i  i 
1-0 

We  have  shown,  then,  that  bandpass  functions  can  be  treated  exactly 
as  the  video  functions,  and  therefore,  the  procedures  and  conclusions 
of  the  body  of  this  paper  apply  equally  well  to  the  bandpass  case. 
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